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SPECIFICATION 



To all whom it may concern: 

Be It Known, That I, ANTONIO NEVAREZ, of San Diego, CA, have invented 
certain new and useful improvements in POOLING DATA IN SHARED DATA 
WAREHOUSE, of which I declare the following to be a full, clear and exact description: 
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POOLING DATA IN A SHARED DATA WAREHOUSE 

Background 

Many business entities, such as product retailers and financial institutions, collect and 
maintain large amounts of business information. The types of information that business entities 
routinely collect include data describing the products and services offered by the business 
entities, data describing tlie amounts and types of transactions involving those products and 
services, and data describing the customers who engage in those transactions. 

These business entiiies often use very large parallel database systems, such as NCR 
Corporation's Teradata Active Data Warehouse system, to collect and maintain this raw data. 
More importantly, the buiiiness entities use data warehousing systems to analyze the raw data, 
retrieving and organizing the data in a manner that leads to a more complete and useful 
understanding of the busiaess conducted by these entities. In many cases, data warehouses are 
used specifically to improve the quality and profitability of product or service delivery to 
customers. For example, the data stored in a data warehouse, when retrieved and organized in a 
certain manner, allows the owner of that data to profile and classify its customers and then study 
purchasing habits within iJiose classes. Profiling in this manner allows the business entity to 
make more intelligent decisions in building and pricing its product supply. 

In general, the data stored in a data warehousing system is hmited to that which the owner 
of the system can collect through its normal business channels. Some business entities augment 
their data warehouses with purchased data or data obtained from other sources, e,g,, through 
questionnaires or forms completed by customers on the promise of some benefit received in 
return. This type of data, however, still limits an entity's understanding of its own business or 
the preferences of its own customers. 

Sunamary 

Described below is a system that enables business entities who share a common business 
interest, such as a common set of customers, products, or channels, to pool data collected during 
the course of business and analyze the pooled data. Pooling data in this manner gives each of the 
business entities a more cDmplete understanding of its business than it would otherwise be able 
to achieve in analyzing only its own business data. Using an independent third-party broker to 
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tightly control access to ttie pooled data ensures that each of the participating business entities 
can access only that data to which it is entitled under the data-sharing agreement. 

One technique described below is useful in improving the profitability of one or more 
business entities. A com]3uter system receives business-related data from at least two business 
5 entities and stores at least some of the data from each of the business entities in a conmion 
database. Then, in accordance with the terms of an agreement among the business entities, the 
computer system allows at least one of the business entities to receive information gathered from 
the common database. 

Another technique is useful in operating a consortium among a number of distinct business 
10 entities. The party responsible for managing the consortium collects business-related data 

gathered by the business (Entities. At some point, the business entities enter into an agreement for 
pooling the collected datci in a shared data warehouse, and the managing party extracts 
^11 information from the pooled data in accordance with the terms of this agreement. The managing 

party then delivers at least some of the extracted information to each of the business entities in 
fll the consortium. 

The consortia described below are implemented, at least in some instances, in computer 
Q networks. The typical network includes one or more channels for use in receiving business- 

related data from at least two business entities and a shared database in which at least some of 
^JJ the business-related data is stored. The shared database includes a mechanism for use in granting 
!m database access only in accordance with the terms of an agreement among the business entities. 

The computer network also includes a computer system that gathers information from the shared 

database when instructed to do so by a party having access to the database and then delivers the 

information to at least ono of the business entities. 

Other features and advantages will become apparent from the description and claims that 
25 follow. 



Brief Description of the Drawings 
FIG. 1 is a schematic diagram of a network through which businesses pool business-related 
data in a shared data ware;house. 
30 FIG. 2 is a flowchait of a technique for use in pooling and analyzing business-related data 

in a shared data warehouse. 
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Detailed Description 

FIG. 1 shows a netv/ork 100 through which multiple business entities 105, 110, 115, 120 
share business-related data in a shared data warehouse 125. The business entities may elect to 
5 share virtually any type of data, including consumer data that describes the consumers who 
purchase goods or servicers from the businesses; product data that describes the products or 
services delivered by the businesses; and operations data that describes the business operations 
carried out by the business entities. 

In general, the participating businesses are separate legal entities, such as corporations, 
10 partnerships, and sole proprietorships, which have united under a negotiated agreement to form a 
C3 data-sharing group, or consortium. Consortium members are usually unrelated businesses that 
m share a conmion interest in some aspect of the business they conduct. For example, the 
'^^ businesses might share a conmion set of customers or participate at various stages of delivery of 
iy a common product or service. In many cases, the consortium includes the manufacturer of a 
M product, a retailer of that product, and a shipper who delivers the product from one to the other. 

In some cases, the consortium includes businesses that are related entities, such as a parent 
Cn corporation and its wholly- or partially-owned subsidiaries. The business entities that form the 
in consortium usually are not direct competitors in the business areas for which they are willing to 
y share data, but they typically harbor enough distrust that they are not willing to provide the other 
20 consortium members with direct or unrestricted access to their business data. 

To ensure that each consortium member's access to the shared data is properly restricted 
and monitored, the participating businesses 105, 1 10, 1 15, 120 typically deliver the data to an 
independent, third-party s;ervice provider, or broker 130. The broker 130 maintains and often 
owns the data warehouse 125 into which the shared data is pooled. Access to the shared data 
25 warehouse 125 is govemtid by the terms of an agreement negotiated among the consortium 
members. The agreement might be negotiated by the consortium members themselves, or the 
broker 130 might lead the negotiation. 

Some agreements grant direct access to the pooled data only the broker 130. Other 
agreements grant some level of restricted access to the consortium members. For example, the 
30 agreement might allow nr embers to submit queries of only a certain type or of certain scope, or it 
might allow queries against only a portion of the data in the warehouse. In most situations, none 
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of the consortium members is granted full, unrestricted access to the shared data. Conventional 
network security and privacy techniques, such as password protection, secure servers, and 
permissions, are used to restrict access to the shared data warehouse 125. 

As described below, the broker 130 and/or the consortium members perform queries 
5 against the shared data, t}pically using a database-query tool that is based on a common 
database-query protocol, such as the Structured Query Language (SQL). Query results are 
delivered to the consortium members in accordance with the negotiated agreement. 

The data stored in the shared data warehouse 125 is derived from a variety of sources, such 
as point-of-sale terminals 135, inventory-tracking terminals 140, and other data-collection 
10 systems connected to computer networks 145, 150, 155, 160 maintained by the consortium 
O members. Many consortium members, such as members C and D in the example of FIG. 1, 
Lfl maintain their own data v/arehouses 165, 170 to store and analyze the data collected from these 

sources. Other consortium members, such as members A and B in this example, deUver data 
W directly to the broker's computer network 175 without storing it locally in a data warehouse. In 
fll many cases, the broker 130 maintains separate data warehouses 180, 185 for some of the 
L consortium members (e.g., members A and B in FIG. 1) and provides data warehousing services 
Cn to these members under c-utsourcing agreements. In these situations, the broker 130 transfers 
in data from the outsourced data warehouses 180, 185 to the shared data warehouse 125 in 
^:!f accordance with the terms of the data-sharing agreement. Likewise, the terms of the data-sharing 
20 agreement govern the transfer of data from the private data warehouses 165, 170 of other 
consortium members (e.g., members C and D in FIG. 1) into the shared data warehouse 125. 

In general, there is no minimum or maximum limit on the amount of data stored in the 
shared data warehouse 125. Some consortia pool relatively small amounts of data, e.g., on the 
order of tens or hundreds of gigabytes. Many shared data warehouses, however, are very large, 
25 often exceeding tens or even thousands of terabytes (TB), especially when the consortium 

members are large national or international companies with many product or service offerings in 
many geographic locations. Shared data warehouses, like private data warehouses, also tend to 
grow rapidly. Therefore, a highly scalable data warehousing system that is capable of managing 
large amounts of data (e.g., hundreds or even thousands of terabytes) is best suited for use in a 
30 data-sharing consortium. The Teradata Active Data Warehouse system offered by NCR 
Corporation is one example of such a data warehousing system. 



4 



NCR Docket 9049.00 



In general, the shared data warehouse 125 includes one or more computing systems 190 in 
the broker's network 175 that maintain the shared data in a storage facility 195, such as an array 
of magnetic or optical disks. Functions performed by the computing systems 190 include 
placing data into the shan^d warehouse, removing data from the warehouse, analyzing the pooled 
5 data, generating reports, cind delivering the reports to the consortium members. The computing 
systems 190 also restrict access to the pooled data, applying a set of rules grant access only in a 
manner that is consistent with the terms of the negotiated agreement. 

The computing systems 190 operate under the control of one or more computer programs 
199, each consisting of computer-readable instructions stored on a persistent storage medium, 

10 such as a CD-ROM or a hard disk. In executing one of the programs, a computing system copies 
the instructions from the j^ersistent storage medium into a temporary storage module, such as a 
random access memory (]^AM) chip, and then executes the instructions from there. 

FIG. 2 shows a technique for creating and using a shared data warehouse. Either the third- 

£ party broker or a prospective consortium member builds the consortium by approaching other 
prospective members and negotiating a data-sharing agreement among them (step 200). Once 

\U the consortium has been ^established, the members collect data about certain aspects of their 

businesses, such as the types, quantities, and prices of items sold in various geographic locations 
(step 205), The members periodically deliver this data to the third-party broker, in accordance 

in with the terms of the data-sharing agreement (step 210). The broker pools this data into the 

M shared data warehouse (step 215). In general, the broker governs access to the data in the shared 
data warehouse. 

When instructed by the consortium members, or at times previously agreed upon by the 
consortium members, the broker analyzes the pooled data in accordance with the terms of the 
data-sharing agreement (step 220). Analyzing the data usually involves performing a series of 

25 queries (e.g., SQL-based queries) against the shared data. The broker tailors the analysis, as well 
as the analysis results, to the requests of the individual consortium members (step 225). For 
example, one member might request information about the types of customers that purchase 
certain products ("custonfier profiling"), while another member might request information 
indicating whether any nexus exists between purchases of different types of products ("shopping 

30 cart analysis"). The broksr then delivers the analysis results to the consortium members (step 
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230). In general, the consortiunm members store the analysis results locally, such as in data 
warehouses maintained by the individual consortium members (step 235). 

In some embodiments, the broker delivers the analysis results to the consortium members 
in the form of reports. These reports are often generated from templates created by the broker in 
5 conjunction with the cons.ortium members. Using report templates ensures that the consortium 
members all know in advance what type of information they can receive and in what form they 
will receive that information. 

Sharing data in the jxianner described here provides many advantages to members of the 
data- sharing consortium, as well as to consumers. Members of a data-sharing consortium have 

10 access to more sources and types of business-related data than do their competitors. This gives 
the consortium members enhanced business knowledge and, thus, a competitive advantage over 

^.D other companies. Sharing data also helps the participating businesses to discover unrecognized 

^ business opportunities. For example, a data-sharing arrangement between a manufacturer and a 
shipping company can help the shipping company increase its load efficiency, thus increasing its 

ril profits and reducing the manufacturer's shipping costs. Consumers benefit from these data- 

^" sharing arrangements through improved delivery and lower cost of products and services. 

y Using a third-party broker to maintain the shared data warehouse ensures the security and 

privacy of the raw data contributed by consortium members. A third-party broker also helps 

'^l ensure adherence to the t(^rms of the data-sharing agreement and fairness in distributing the costs 
and benefits of the shared data warehouse. 

The text above describes the invention in terms of one or more specific embodiments. The 
invention also appears in a variety of alternative embodiments and thus is not limited to the 
embodiments described above. For example, in some embodiments, the third party broker 
maintains more than one shared data warehouse for each data-sharing consortium. The broker 

25 also frequently maintains shared data warehouses for more than one data-sharing consortium. In 
many cases, a particular business entity is a member of multiple consortia. The business entity 
might, in this situation, authorize the broker to place all or some portion of its data into the 
shared data warehouses fox more than one of the consortia in which the business participates. In 
some embodiments, one or more of the consortium members, and not an independent third party, 

30 maintains the shared data warehouse. Accordingly, embodiments other than those described here 
are within the scope of the following claims. 
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Claims 

What is claimed is: 

1 . A method for use in improving profitability of one or more business entities, the method 
comprising: 

receiving business-related data from at least two business entities; 

storing at least somt^ of the data from each of the business entities in a common database; 

and 

in accordance with i:he terms of an agreement among the business entities, allowing at least 
one of the business entities to receive information gathered from the common database. 

2. The method of claim 1, where allowing one of the business entities to receive 
information gathered from the common database includes gathering at least some of the 
information from data received from a business entity other than the one receiving the 
information. 

3. The method of claim 1, where allowing one of the business entities to receive 
information gathered from the common database includes granting that business entity 
permission to gather the information directly from the common database. 

4. The method of claim 1, where allowing one of the business entities to receive 
information gathered from the common database includes gathering the information on behalf of 
that business entity and then delivering the information to the business entity. 

5. The method of claim 1, where storing data in the common database includes storing the 
data in a database maintamed by an entity in the business of providing data warehousing 
services. 

6. The method of claim 1, further comprising granting unrestricted access to the common 
database only to a third p;arty named in the agreement among the business entities. 
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7. The method of c; aim 1 , where storing data in the common database includes storing the 
data in a scalable data warehouse. 

8. The method of c. aim 1, where storing data in the conmion database includes storing the 
data in a data warehouse liaving a total capacity of at least approximately one terabyte* 

9. The method of c. aim 1, where storing data in the common database includes storing the 
data in a data warehouse liaving a total capacity of hundreds of terabytes. 

10. The method of claim 1, where receiving business-related data includes receiving data 
that describes the customi^rs of at least one of the business entities. 

11. The method of claim 1, where receiving business-related data includes receiving data 
that describes the products offered by at least one of the business entities. 

12. The method of claim 1, where receiving business-related data includes receiving data 
that describes the business operations of at least one of the business entities. 

13. The method of :;laim 1, where receiving business-related data includes receiving a 
combination of at least tv^^o of the following types of data from the business entities: consumer 
data, product data, and opierations data. 

14. The method of ^laim 1, where receiving business-related data includes occasionally 
receiving new data from the business entities. 

15. The method of claim 1, where receiving business-related data includes receiving data 
that is also stored in a private data warehouse maintained by one of the business entities. 
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16. The method of claim 1, where receiving business-related data includes receiving data 
that is also stored in private data warehouse maintained on behalf of one of the business entities 
under a data- warehouse services agreement. 

5 17. The method of claim 1, further comprising negotiating the agreement among the 

business entities in the group. 

18. A method for use in operating a consortium among a number of distinct business 
entities, the method comprising: 
10 collecting business-related data gathered by the business entities; 

negotiating an agreement among the entities for pooling the collected data in a shared data 
warehouse; 

in extracting information from the pooled data in accordance with terms of the negotiated 

^ agreement; and 

5$ delivering at least some of the extracted information to each of the business entities in the 

fU consortium. 

P;' 19. The method of claim 18, where collecting data includes pooling the data into a data 

iri warehousing system own^d by a third party who does not contribute any of the pooled data. 

20. The method of claim 18, where collecting data includes pooling the data into a data 
warehousing system owned by a third party who is in the business of providing data warehousing 
services. 

25 21. The method of claim 18, where negotiating the agreement includes drafting the 

agreement to include terms governing the types of data to be placed in the shared data 
warehouse. 

22. The method of claim 18, where negotiating the agreement includes drafting the 
30 agreement to include terms governing access to the shared data warehouse. 
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23. The method of claim 18, where negotiating the agreement includes drafting the 
agreement to include terms governing the types of information that can be extracted from the 
shared data warehouse, 

5 24. The method of claim 18, where negotiating the agreement includes drafting the 

agreement to include terms providing for payment of money in exchange for services provided 
by a third party selected to maintain the shared data warehouse. 

25. A computer network comprising: 

10 one or more channels for use in receiving business-related data from at least two business 

y entities; and 

ijl a shared database comprising: 

a storage f acil ity in which at least some of the data received from each of the business 
y entities is stored; 

3i a mechanism i or use in granting database access only in accordance with the terms of 

an agreement among the business entities; and 
Cm a computer system configured to: 

iff gather information from the shared database when instructed to do so by a party 

y having access to the database; and 

20 deliver tiie information to at least one of the business entities. 

26. A computer prc»gram, stored on a computer-readable medium, for use in analyzing 
business-related data, the program comprising executable instructions that, when executed by a 
computer, cause the computer to: 

25 receive business-related data from at least two business entities; 

store at least some of the data received from each of the business entities in a shared 
database; 

grant database access only in accordance with the terms of an agreement among the 
business entities; 

30 gather information from the shared database when instructed to do so by a party 

having access to tbie database; and 
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deliver the infi3rmation to at least one of the business entities. 
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Abstract 

A computer network is used to improve the profitability of one or more business entities. 
The network receives business-related data from at least two business entities and stores at least 
some of the data from each of the business entities in a common database. Then, in accordance 
with the terms of an agreement among the business entities, the network allows at least one of 
the business entities to receive information gathered from the common database. 
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